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Abstract 

In a recent paper (Asci et al., 2008) it has been shown that certain 
random continued fractions have a density which is a product of a beta 
density and a hypergeometric function 2F1. In the present paper we fully 
exploit a formula due to Thomae (1879) in order to generalize substan- 
tially the class of random continuous fractions with a density of the above 
form. This involves the design of seven particular graphs. Infinite paths 
on them lead to random continued fractions with an explicit distribu- 
tion. A careful study about the set of five real parameters leading to a 
beta-hypergeometric distribution is required, relying on almost forgotten 
results mainly due to Felix Klein. 

Keywords: Distributions on (0,1) with five parameters. Gen- 
eralized HYPERGEOMETRIC FUNCTIONS. PERIODIC RANDOM CONTINUED 
FRACTIONS. AMS classification: 60J05, 60E05. 



1 Introduction 



Recall that 2-F1 is the hypergeometric function defined as follows: the sequence 
of Pochhammer's symbols {(t) n }^°_ is given for any real number t by (t)o = 1 
and (i)n+i = {t+n)(t) n . For real numbers p, q, r such that — r ^ N = {0, 1, 2, . . .} 
and for < x < 1 the number 2F1 (p, <?; r; x) is the sum of the power series 

V (P)n(q)n n 

71=0 

Let v — (a,b,p,q,r) T be in M. 5 , where T means transposition. Consider the 
function 

h(a, b,p, q, r; x) = x -^ 1 (1 - xf^ 1 2 F\ (p, q;r;x) , < x < 1, (2) 

and suppose that it is non negative and integrable on (0,1). The distribution 
BH{v) of X G (0,1) with density proportional to h(a,b,p,q,r;x) is called a 
beta hypergeometric distribution. An important example is 

BH(1, 1, 1, 1, 2){dx) = 4- log l ( o,i) (3) 
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If c, d > the probability measure 

1 °~ ^ 

^ {dw) = B(c,d) (1 + W )^ 1 ^)^ dw 

is called a beta distribution of the second kind. For instance if U ~ 7 C and 
V ~ 7d are independent where the gamma distribution is 
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7 c (dw) = ■ F? - T w c x e tl l (0iOo) (w)c!'u; 



f 2") 

WO have ~ Consider now two independent random variables X ~ 

BH(v) and W ~ /3^, , , where a' = r — a and b = a + b — p. Also denote 
p' = r + b—p — q, q' = b, r' = r + b — q. Therefore passing from v = (a, b, p, q, r) T 
to v' — (a', b' ,p' ,q' , r') T is a linear automorphism of M 5 : 

v i y v — Mv 

where M is a 5 x 5 matrix (see (|23p ). Similarly passing from v = (a, b,p, q, r) T 
to 

Hv = (a + b — p ,r — a) 

is a linear operation. The present paper is based on the crucial fact that the ran- 
dom variable (1+WX)^ 1 £ (0, 1) has a density proportional to h(a' , b',p',q',r';x). 
In other terms we have the following 

(2) 

Basic identity: Let X <~ BH(v) and W <~ /Jfj^ be independent. Then 

' BH(Mv). (4) 



1 + WX 



If X ~ BH(v), U ~ 7b' and ~ j a ' are independent an other presentation of 
the result is saying that v + ux ~ BH{Mv). The proof of the basic identity will 
be given by using the formula obtained in 1879 by Thomae, which is an identity 
concerning the generalized hypergeometric function 

^(AB,C-,D,E ;x) = ± ^l^>y . (5) 

Around this isolated basic identity, various considerations arise: 

What are the beta hypergeometric distributions? The exact knowl- 
edge of the acceptable set of parameters v = (a, b,p, q, r) T such that the distri- 
bution BH(v) exists needs a careful study of the positivity on (0,1) of 2-Fi- 
We rely for this problem on a 1890 paper by Felix Klein studying the number 
of zeros of 2F1. Section 2 is devoted to a detailed description of a set P of 
parameters v = (a, b,p, q, r) T such that the distribution BH{v) exists. Actually 
the BH(vYs exist on a set of v's which is slighly larger than P; this larger set 
has an involved description and is not really useful to our purposes. 

Identifiability of the beta hypergeometric distributions. Things are 
complicated by the fact that the same beta hypergeometric distribution can be 
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generically represented in four ways, first because of the symmetry (p, q) and 
more importantly by the existence of the Euler formula for 2 -Pi : 

2-Fi (p, q; r; x) = (1 - x) r_p_,J 2-P1 (r - P, r - q; r; x) . (6) 

(see Rainville (1960) page 60). In other terms consider the two 5x5 matrices 
T and S defined respectively by (a,b,q,p,r) T = T(a,b,p,q,r) T and (a, b + 
r — p — q,r — p,r — q,r) — S(a,b,p, q,r) T . The symmetry between p and q 
implies BH(v) = BH(Tv) and the Euler formula implies BH(v) = BH(Sv). 
Actually the group generated by T and S has four elements {J5, T, S, TS} and 
is isomorphic to (Z/2Z) 2 . Therefore the same beta hypergeometric distribution 
BH(v) of X can be generically represented in four ways by v, Tv, Sv and TSv. 
On the other hand when v — (a, b,p, q, r) T we have 

(a + b — p, r — a) = lit; = TLTSv, (a + b — q,r — a) = HTv = HSv. 

This implies that given v there are only two ways, and not four, to use the basic 
indentity: 

(2) (2) 

Basic identity, revisited: Let X ~ BH(v) and W ~ /3 n j = PyitSv ^ e 
independent. Then 

1 



1 + WX 



BH(Mv) = BH(MTSv) 



(2) (2) 

If X ~ BH(v) and ~ /^nTv — ^ri5^ arc independent, then 

1+ ] vx - BH(MTv) = BH(MSv). 

The reader can immediately verify that MTS = TM and MS = TMT. 

Explicit distributions of some random continued fractions. The 
iteration of this basic identity leads to various types of random continued frac- 
tions which are described in Section 4. They are splitted in seven categories 
corresponding to the seven partitions of the number 5, which are 

5, 4 + 1, 3 + 2, 3 + 1 + 1, 2 + 2 + 1, 2 + 1 + 1 + 1, 1 + 1 + 1 + 1 + 1. (7) 

For instance as an application of Theorem 4.2 we see that since v = (1,1,1,1, 2) T 
is an eigenvector of M for the eigenvalue 1 and that liv — (1,1). Therefore if 
the W„'s with n — 1, 2 . . . are iid such that Pr(0 < W n < w) = jf^ (that is if 

W n ~ fifl) we get that 

1 



X = 



1 -L- Wl 
1 ' , , Wo 



has distribution BH(v) given by This example belongs to the partition 5. 

To sum up, Section 2 is a thorough study of BH distributions, Section 3 
proves the basic identity and introduces a crucial reparameterization of the 
BH laws for which in particular symmetries due to the Euler formula appear 
clearly. Section 4 applies the previous material to random continued fractions 
after introducing the graphs associated to a BH law. This section contains 
Theorem 4.2, the main result of the paper. Finally Section 5 is a detailed 
study of the cycles of the graphs defined on Section 4, which makes simple the 
application of Theorems 4.2 and 4.3. 
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2 Beta-hypergeometric distributions 



This long section (which can be skipped in first reading) describes the class of 
probability densities on (0, 1) with five real parameters (a, b,p, q, r) called beta 
hypergeometric densities. They have the form x i— > Ch(a, b,p, q, r; x) where h is 
defined by @ and where C is a suitable constant. To achieve this description of 
the beta hypergeometric densities we have to answer to the following questions 

1. Positivity problem: when 2 F\ (p,q;r;x) > on (0, 1)? The set of ac- 
ceptable (p, q, r) will be denoted by P. 

2. Integrability problem: if h = h(a,b,p,q 1 r;x) > when do we have 

hdx < oo? We shall define at the end of Subsection 2.2 below a set P 
of acceptable (a, b,p, q, r) such that this is fulfilled, with (p, q, r) is in IP. 

3. Identifiability problem: if h(a*,b*,p* 1 q*,r*;x) and h(a,b,p,q,r;x) 
are proportional what are the relations between (a* , b* , p* , q* , r* ) and 
(a,b,p,q,r)l 

For doing so we recall a few classical facts about the hypergeometric function. 
It is clear that if p (or q) is a negative integer, then the power series (TTJ happens 
to be a polynomial. In particular, if p — 0, then 

2 F 1 {0,q;r;x) = l, (8) 
for any value of q and r, whereas if p = —1 then 

2 F 1 (-l,q;r;x) = l-^-x, (9) 
r 

meaning that for q = sr, the function ([9]) is always equal to 1 — sx. Next recall 
that, provided —t £ N, we can express (t) n as T (t + n) /T (t) . Such a formula 
gives meaning to T(t) when t < and t ^ — N. Since actually z i-> l/r(z) is an 
entire function we write 1/T(— n) — when n € N. 

Whenp and q are not negative integers we can apply Stirling's approximation 
to the Gamma function to evaluate the order of the general term of the power 
series ([T]). It turns out that it is equivalent (up to a multiplicative constant) to 
n -i-(r-p-q) _ Thjg implies that its radius of convergence is equal to 1, therefore 
the series is certainly well defined for 16 [0, 1) . It is convergent in x = 1 if and 
only if r — p — q > 0, in which case (see Rainville (1960) page 49): 

xfl I (r — p)i (r — q) 

Note that this implies that this limit is zero if either p-rSNorg-reN. 
F 

2.1 Positivity 

The well known fact that 2Fi(p,q;r;x) — z(x) is an analytic solution of the 
second order differential equation 

x (1 - x) z" + [r ~ (p + q + 1) x]z' - pqz = (11) 
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in the unit disk shows that the zeros of 2-F1 (p, T> r i x ) on (0,1) are simple (for 
if = z(xo) = z'(xo) we easily see by induction using (fTTj) that z^(x a ) = 
for all k G N and thus z = which contradicts z(0) = 1). For this reason 
2-Fi(p, q; r;x)>0 for x G (0, 1) if and only if 2-F1 has no zeros in (0, 1). Denote 

F = {(p,q,r) ; el 3 ; r ^ — N, F(p,q;r;x) > Vx G (0,1)} (12) 

The aim of this subsection is to describe P. 

The determination of the number of zeros of 2-F1 is a difficult question which 
has been investigated in a number of papers, including Klein (1890), Hurwitz 
(1891), Van Vleck (1902). To this aim Klein introduces the function x i-> E{x) 
on the real line defined by E(x) = if x < 1 and E(x) = N if N is the positive 
integer such that N < x < N + 1. Klein finally introduces the non negative 
integer 

X = X(p,g,r)=£;(i(|p-g|-|r-l|-|r-p- ? |+l)) 
and proves the following theorem 

Theorem 2.1. Suppose that r ^ — N. Then the number of zeros Z of 1 4 
-P'CP) <Z; r \ x ) m (0, 1) is cither X or X + 1. 

This theorem leads to the description of P for which we introduce the following 
notation: for x G R the number s(x) G {—1,0,1} is sign 1/T(x) with the 
convention sign(0) = 0. Therefore s(x) = 1 for x > or — 2k — 2 < x < —2k — 1 
with k G N, s(x) = -1 for -2k- 1 < .t < -2fc with fc <G N, s(x) = for x e — N. 
We need also to consider the set 

S = {(x, y, z) g R 3 ; s(x)«(i/)«(y) > 0} (13) 

Therefore the part of 5 1 which is in the octant x, y, z < is a union of unit 
cubes; in the octant x,y < 0, z > the set S 1 is a union of columns with unit 
square section; in the octant x < 0, y, z > the set S is the union of slices of 
unit height; in the octant x, y, z > S is the octant itself. 

Theorem 2.2. Suppose that r ^ — N. Then (p, q, r) G P if and only if 

1. either r>l, r — p — q<0 and p, q > 0; 

2. or r > 1, r — p — q>0 and r — p, r — q > 0; 

3. or r < 1, j — p — q < 0, r — p,r — q < 1 and (p, g, r) is in S 1 ; 

4. or r < 1, r — p — q > 0, p, q < 1 and {r — p,r — q, r) is in 

Remark. The most obvious case of positivity is p,q,r > 0. It can be checked 
that it fulfills the sufficient conditions above to be in P. The same holds for 
r — p, r — q, r > 0. 

Proof. From Klein's Theorem 2.1 a necessary condition for (p,q,r) G P is 
X(p, q, r) — 0. We now consider three cases. 
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The case r > l.<=: Trivially if p, q > and r > 1 we have F(p, q; r; x) > for all 
x £ (0, 1). If r— p, r — q > Euler formula © provides the same result. =>: From 
Theorem 2.1 we have X(p, q, r) — , namely g(|p— q\ — \r— 1| — \i — p— q| + l)) < 1 
or 

\p-q\<r+\r-p-q\ (14) 

Now we can assume p > q. If r — p — q > (j 14[) implies r — p,r — q > 0. If 
r — p — q < (H"4"|) implies p, q > 0. 

The case r < 1 and r - p - q > 0.<=: If p, q < 1 this implies X = since 
X(p, q, r) = <S> i(|p - g| - |r - 1| - |r - p - q\ + 1)) = max(p, q) < 1. (15) 

If r — p — q > then from (fl"0| the limit of .F(p, q; r; x) as x tends to 1 is 

r(r-p)r"(r-g) If r ~ P and r ~ 1 aTe not in ~N this number r[r-p)r(r-g) is > 
since (r — p, r — q, r) is in S. Therefore Z cannot be X + 1 = 1. 

If r — p or r — q is in — N, things are more complicated since in this case 
F(p,q;r; 1) = Oby ([ID]). We introduce Q(x) = x 1_r 2 -Fi(p+l-r, fl+l-r; 2-r;x) 
which is an other solution of (|lll) which is linearly independent of 2-F\(p, 9; x) 
Applying the part r > 1 of this theorem to the function 2-Fi(p + l — r, q + 1 — 
r; 2 — r; x) shows that Q(x) > on (0,1) (for this, observe that X(p,q,r) — 
X(p + 1 - r,q + 1 - r, 2 - r) and thus X(p + l-r,g+l-r,2-r) = 0). 
Now assume that Z = 1, denote by xo the zero of 2Fi(p,q;r;x) in (0,1) and 
apply the Sturm Liouville theorem to the interval (xo,l) and to the pair of 
solutions 2-Fi(P) 9; r; x), Q{x) of (|11[) . Thus Q should have a zero in (xo,l), 
a contradiction. Therefore Z = X = 0. The remaining case r — p — q = is 
obtained by passing to the limit. 

=>: Since (p, q, r) € P we have X(p, q,r) — and p and q < 1 from (fl"5j) . If 
r — p — q > we use the fact that the limit in 1 of 2^1 (p, 9! r; x) as x tends 
to 1 is rlr^p^rXr-l?) ana - * s — ^- This implies (r — p, r — q, r) is in 5. Again, the 
remaining case r — p — q = is obtained by passing to the limit. 

The case r < 1 and r — p — q < 0. This case is reduced to the preceding one 
by the Euler formula ©• □ 



2.2 Integrability 



Proposition 2.3. Let (a,b,p, q,r) G K 5 such that r ^ — N. Assume that 
/i(a, 6,p, q, r; x) > on (0, 1), that is (p, q, r) € P defined by (TT2"1) . The condition 

a, 6, r + 6-p-q>0 (16) 

is sufficient for having / = h(a,b,p,q,r;x)dx finite. Under these circum- 
stances 

^ ™ £ !™t - *h ») .« «. * r, . + M) ■ (IV, 

where B (a, b) = pgjpj^ , whereas the second factor 3F2 is a generalized hyper- 
geometric function, evaluated at x = 1 . Conversely if 



6 



1. either r — p — q> 0, r — p ^ — N and r — q ^ — N 

2. orr — p— q<0, p $ — N and g ^ -N 
(HHJ) is also necessary to have / finite. 

1 /2 

Proof. If a > trivially J /lefcc is finite. If r — p — g > then 2^1 (p, 9; 
has a finite limit in 1 and b > implies that , 2 /ida; is finite. If r — p — q < Oby 
pT|) and the fact that r + i)-p-g > we similarly get that J^ 2 hdx is finite. If 
r — p — q = note that the general term of the defining scries for h(a, b,p, q, r; x) 
has a constant sign. Therefore monotone convergence is applicable and the sum 
is B(a,b) 3F2 (p, q,a;p + q, a + b;l) . But the series 3F2 (p, q, a;p + q, a + b; 1) 
converges because p+q+a+b— p — q — a = b > 0. This reasoning shows also 
that (fT7| holds. Therefore the direct part is proved. 

Conversely if r~p—q > if none of r— p and r — q is in — N we can claim that 
the limit of 2^i(p, <7! f", x) for x — > 1 exists and is not zero. Thus if furthermore 
J Q h(a, b,p, q, r; x)dx is finite clearly a and b are positive, and r + b — p — q > 
a fortiori. The case r — p — q < Ois similar by the Euler formula For 
r — p — q = we use the fact that if neither p nor q are in — N we have 

2 F 1 (p,q;p + q;x)~ x ^ 1 1 log — i— . (18) 
B(p,q) l-x 

To see this apply a classical result about the asymptotic behavior of a power 
series near the circle of convergence (see Titchmarsh (1939, 7.5)) to 

_ T(p + n)T(q + n) _ 1 
L(P + q + n) n 

Note that if (p, q,p + q) is in P defined by fj 12[) Theorem 2.2 implies that for 
p + q > 1 then p and q are positive and for p + q < 1 we must have (p, q,p + q) 
in S; thus B(p, q) > even when p or q are negative. Clearly (|T5|) implies that 
h is integrable only if a, b > 0. □ 

For the sake of completness in the following proposition we consider the cases 
where conditions 1 and 2 of the previous proposition are violated. 

Proposition 2.4. Let (a,b,p, q,r) e R 5 such that r ^ — N. Assume that 
h = h(a, &,p, <7, r; x) > on (0, 1) or equivalently that (p, q, r) is in P. 

1. If 7' — p — q > and r — p or r — q is — n with neN then Jq 1 /wAe < 00 if 
and only ii a, r + b — p — q > (but 6 < is allowed) 

2. If r — p — q < and p or q is — n with n € N then Jq 1 /idx < 00 if and only 
if a, b > (but r + &— p — g<0is allowed) 



Proof. Parts 1 and 2 are obvious. To show Part 4, observe that if r — p — q < 
and ft > on (0, 1), Theorem 2.2 implies r < 1 since p = — n (say) is negative. 
The inequalities of Theorem 2.2 give r—p = r+n < 1 and r—p—q = r+n—q < 0. 
For simplicity we write s = r + n < 1. This implies s < q. Since r = 1 — n is 
forbidden by r ^ — N, we have s < 1. Now 



lFl { - n ' q; r; 1} = (i^y; g if i^f = " ( i- s) r > °- 



(q)k (1 - s)n-fc _ (q - s + l) n 
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Thus Jq 1 hdx < oo if and only if a, b > 0. Part 3 is proved from part 4 by using 
the Euler equality ([6]). 

Remark. From the two previous results the cases where J* hdx < oo does not 
imply min(6, r + b — p — q) > are truly exceptional. An example is obtained 
from Proposition 2.4 part 1, with r = p and q = —2; in this case J hdx < oo if 
and only if a, b + 2 > 0. Thus for instance b = — 1 is allowed. For this reason we 
coin the following definition, using ([5]): 



Definition. The beta hyperbolic distribution 

x^il-xf- 1 
B(a,b) 3 F 2 {a,p,q;r;a + b;l) 



BH(a,b,p,q,r)(dx) = ——l i0 . 1) (x)dx 

B(a,b) H F 2 {a,p,q:r:a + b:l) 



is defined if and only if 

1. r g — N; 

2. (p, q, r) is in P described in Theorem 2.2; 

3. a > 0, b > and r + b- p- q>0. 

In the sequel we denote by P the set of (a, b,p, q, r) satisfying the three condi- 
tions above. 

Remark. It is immediately verified that the set P is invariant under both T and 
S. Recalling the remark following Theorem 2.2 it is also immediately obtained 
that the posititity of all the components of the vector v = (a, b,p, q, r) T (or Sv) 
together with the condition r + 6 — p — q > (or b > 0) ensure that ueP. 

2.3 Identifiability 

This subsection adresses to the problem of the identifiability, since we already 
know that 

BH(a, b,p, q, r) = BH(a, b, q,p, r) = 

BH(a, r + b — p — q,r — p,r — q,r) = BH(a, r + b — p — q,r — q,r — p,r) 

from the obvious symmetry in (p, q) and from Euler identity. Therefore BH 
distributions could have four different representations. On the other hand, for- 
mulas © and © show that the number of representations of the same BH 
distribution can even be infinite. We are interested in deciding when a function 
h(a,b,p,q,r;x) defined by (0) can be represented with different values of the 
parameters o, b,p, q and r. The following result says essentially that, aside from 
the symmetry of 2-F1 in p and q and the Euler's identity (JBJ), the only lack of a 
unique representation is due to the relations © and ©. The theorem does not 
use the results of Sections 2.1 and 2.2. 

Theorem 2.5. Suppose that 

h(a* , b*,p*,q*,r*;x) — Ch(a, b,p, q, r;x),0 < x < 1. 
Then C = 1, a* = a and 
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a Either r* = r, and 

1. either b* = b,{p*,q*} = {p,q}, 

2. or b* = b + r — p — q, {p* , q*} = {r — p,r — q}; 

b Or the function has the form 

h(a,b,p,q,r;x) = x^ 1 {I - x) u ~ X {I - vx) . (19) 

1. If v = either one between p and q is zero and u = b, or one between 
p and q is equal to r, say p, and u = b — q (the same obviously holds 
for b*,p*,q* and r*); 

2. If v ^ 0, 1 (which can be assumed without lack of generality) either 
one between p and q is equal to —1, say p, u — b and v — -, or one 
between p and g is equal to r + 1 , say p, u = b — q — 1 and v = (the 
same obviously holds for b*,p* ,q* and r*). 

Remarks. We see that, when v = 0, and a, it > 0, the right hand side of 
(IT9l) yields the (unnormalized) density of the beta law j3 ajU = BH(a,u,p,0,r) 
where p and r are arbitrary. By a proper choice of p and r we can ensure that 
(a, u,p, 0, r) € P. This shows that beta distributions are BH distributions in the 
sense of Remark 2 in Section 2.2. 

Next notice that if the right hand side of ([TO]) is positive in the whole interval 
(0, 1) it has to be < v < 1; if moreover a, b > then h is integrable and can 
be normalized to become the density of BH(a, u, — 1, rv, r) where r is arbitrary. 
By taking r large enough we can ensure (a, u, —1, rv, r) € P. The corresponding 
law will be called quasibeta and indicated by qfia,u,v Thus beta and quasi-beta 
distributions are beta hypergeometric according to the definition given at the 
end of Section 2.2. 

Proof. By assumption for x € (0, 1) 

(l _ xf- 1 2Fl (p, q; r; x) = Cx^- 1 (1 - xf' 1 2 F x (p* ,q*;r* ; x). 

Dividing the r.h.s. by x a_1 and going to the limit as x I we get that the 
l.h.s. converges to 1. Then the same has to hold for the r.h.s., which implies 
necessarily that a* = a, and C — 1. We thus get 

(1 - x) d 2 F 1 (p,q;r;x) = 2 Fi(p*,q*;r*;x) 

where d — b — b* . Recall that 2 Fi(p,q;r;x) — z[x) is a solution of the second 
order differential equation (1111) . In the sequel we fix the three numbers p,q,r 
and for any real number d we define the linear differential operator L ( i (y) (x) 
defined by 

/, s // , r ,mj -n i /_, dr-pq+[d(d-p-q)+ pq]x 

x(l-x)y + [r+(2d — p — q—1) x\y H y. (20) 

1 — x 

We now show that y(x) = (1 — x) d 2 F 1 (p, q; r; x) satisfies Ld (y) [x) — 0. To see 
this we write 

z = {1- X y d y, z' = d(l -x)- d ~ 1 y+ (1 -x)- d y' 
z" = d(d+l)(l-x)- d - 2 y + 2d(l-xy d ~ 1 y' + (l-x)- d y" 
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and we carry the results in (|TT|) . On the other hand y(x) — 2 Fi(p*, g*; r*; a;), 
hence 

x (1 - x) y" + [r* - (p* + q* + 1) x]y' - p*q*y = 
which subtracted from Ld(y) — yields 

r * , , , , * , * \ 1 / , dr + P*1* - PI + i d ( d - P - g) + W ~ g*g*]g n 

1 — a; 

which by setting 

>1 = d(d — p - q) + pq — p*q* , B = dr + p*q* — pq, 
C = 2d + p* +q* -p- q, D = r - r* 

is rewritten as 

(Cx + D)y' + ^±^y = 

Let us now suppose that C = D = but A and B are not both zero. Then 
y = 0, which is impossible. Now assume C = but D ^ 0. Thus 

, Ax + B 
D (1 — x) 

and thus 

y = (l-a;):jFiO,g;r;a0 = e* x (l-z)^- ■ 

Define di = d — B 7 ) A and y\ — e^ x . Then j/i satisfies L& x (yi) = (using 

notation of PO")) ). Since (yi) (a;) = where P is a polynomial 

of degree 3 with leading coefficient (j)) 2 this implies that A = 0. Therefore 

y = (1 — x) D and we are in the second case. Next let C =^ and D = 0, in 
which case 

y' _ Ax + B _d C 2 
y Cx (1 — x) x 1 — x 

and therefore y = Czx Cl (1 — x) C2 . Since y (0) = 1, this implies that C\ — so 
we are in the second case again. If C ^ and D ^ 0, then we can write 

, ax + j3 

V = (l + cx)(l-x) V} 

where c = ^ ^ 0. We now distinguish between the cases c ^ -1 and c = — 1. 
If -1 then 

y' cCi C 2 



y 1 + ca; 1 — a; 

from which y = (1 + cx) Cl (1 — a;) ' 2 . Define d 2 = d—Ci and y 2 = (1 + ca;) Cl . 
Then y 2 satisfies Ld 2 (y 2 ) = 0. This implies that C\ {C\ + 1) = 0, therefore 
either C\ = (thus we are in case b)) or C\ = — 1 (thus we are in case c)). Next 
suppose that c = — 1 . Then 

y' aa; + /3 Ci C 2 



(1-.t) 2 1-a (1-x) 2 
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from which y = (1 — x) 1 exp < -f^—, >. Define d% = d — C\ and j/3 = exp 




Then y 3 satisfies Ld 3 (2/3) = (using notation of ([2"0")0 . Since L& 3 (y 3 ) (x) — 
(l-a) 3 ^ 3 ( a ')' wnere is a polynomial such that P(l) — Cf , which implies that 
C2 = 0, which falls into case b). Finally suppose that A = B = C = D = 0. 
From D = we get r* = r. Furthermore 

A = B d{r + d-p-q) = 0. 

Thus either d = 0: as a consequence 6 = 6*, moreover p + g = p* + g* (from C = 
0) and pq = p*q* (from A = 0), which means {p, q] — {p* , g*} . Or d = p + q — r: 
as a consequence b* = b + r— p — q, moreover 2r — p — q = p* + q* (from C = 0) 
and (r — p) (r — g) = p*g* (from A = 0), which means {r — p, r — </} = {p*, g*}. 
Thus the condition A = B = C = D = yields case a) of the Theorem. To 
complete part b) if f v (x) = x a ~ Y (1 — x) c ~ we have 

(1 - xf~ c 2 F 1 (p 1 q;r;x) = 1. 

Thus define d± = b — c and 2/4 = 1. Then 2/4 satisfies Ld 4 (2/4) = , which 
implies d^r — pq — and d^ (c?4 — p — q) + pg = 0. Summing the two equalities 
we get c?4 (c?4 + r — p — q) = 0. Thus either c?4 = (and c = b) and pg = 0; or 
<f 4 + r — p — 5 = (and c = 6 + ?* — p — g) and (r — p) (r — q) = 0. Finally, to 
complete part c), if /„ (x) — x a_1 (1 — x) 1-1 (1 + cx), then 

(1 - x) bl (1 + cx) = (1 - x) b 2 F X (p, g; r; z) . 

Define c?5 = 6—61 and y 5 (x) = 1 + cx. Then y 5 satisfes Ld 5 (2/5) = 0. This 
means 

f . ,„ , n s. d 5 r - pq + (d 5 (d b - p - q) + pq)x 

c(r + (2d 5 - p — o — 1) as) H (1 + cx) = 0. 

1 — a; 

Since c 7^ — 1,0, this implies d$ (d.5 + r — p — q) = 0. As a consequence the 
fractional term is the constant d$r — pq, so by equating the coefficients of the 
polynomial at the l.h.s. to zero we get the two equations 

f c(2 + r)d 5 = (p+l)(<7 + l) 
\ (c + d 5 )r=pq 

Thus either d§ = 0, in which case b\ = 6, and (p + 1) (q + 1) = and cr = pq. 
Thus if, say, p — —1, we get q = —cr as stated in the theorem. Or = p + q — r, 
in which case b\ = b + r — p — q, and we get similarly (r — p + 1) (r — q + 1) = 
and cr = (r — p) (r — q). Thus if, say, r — p = — 1, we get r — q = —cr as stated 
in the theorem. □ 



2.4 Examples 

To illustrate the above results, let us recall a few classical equalities (see Abramovitz 
and Stegun (1965) pages 556-7) 

2 F x (p, ~P; h sm 2 9) = cos2 P 0, ^(p, 1 - p; I; sin 2 0) = C ° S(2p " 1)g 
2 2 cos y 
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3 sin(2p-l)0 , 3 9 ^ s'm2p9 

2 F X 1 - p; -■ sin 2 = - V P 2 F 1 (1 +p,l-p\ p sin 2 - — V - 

2 (2p — l)sm0 2 p sin 20 



2^1 (p, l+2p; sin 2 6») = -g—, 2 Fi(p, \+P] 2p; sin 2 9) ' 



2 " " ' (cosf) 2 ?' v ^'2 y ' y ' ' cos^cosf) 2 ?- 1 
aJ Fk(I ,I ; |; S in 2 0)= * 2 F 1 (l,l;|;sin 2 0)= 2 " 



l 2' 2' 2' y sin<2' iV ' ' 2' ' sin26> 

,11. 1 1 

2Fi{p, - + p; x) = 



2 y, 2' ' 2(l + Vi) 2 P 2(1-v^) 2 p' 

1 3 1 1 1 

afi(p, 2+^2^)- 2 (l-2p)V^ ( (l-V^) 1 " 2p ~ (1 + v 7 ^) 1 ^ 

,13, 1 1 + Jx , ,11 
2^(1,-; -;*) = -^log- 2 ^(1, 1; 2; i) = -log- 



2'2' ' 2Vi to l-Vi' -*^-» x b l-a;' 

Many of these identities describe a hypergeometric function in terms of x = 
sin 2 0: this is sometimes useful for describing a distribution on (0,tt/2): the 
image of BH(a,b,p 7 q,r) by x H> = arcsin-yx from (0,1) to (0,tt/2) has a 
density proportional to sin 2a_1 9 cos 26-1 2 Fi(p, q\ r; sin 2 9). 

3 Proof of the basic identity. The 9 parameter- 
ization 

Mellin-LIKE TRANSFORMS of BH(v). Assume now that v = (a,b,p,q,r) T is 
in P. Recall that this set P has been defined at the very end on Section 2.2 and 
is 

P = {(a,b,p,q,r); {p,q,r) G P, r g" -N, a, b, r + b -p - q > 0}. (21) 

(the set P is described in the statement of Theorem 2.2). Let us fix (s,t) such 
that (a + s, b + t,p, q, r) is in P or equivalently such that s > —a and 

t > —6 — min{0, r — p — q}. 

We get from ((T7]) the important formula 

f 1 X s (1 - xf BH(v)(dx) = B ( a + s ' b + t ) 3F 2 {p,q,a + s;r,a + b + s + t;l) 
Jo B(a,b) 3 F 2 (p,q,a;r,a + b;l) 

(22) 

The Thomae formula. There is a fundamental relation between the Eulcr's 
gamma function T and the generalized hypergeometric function 3 F 2 defined by 
([5]) and evaluated at 1 = 1. This relation has been originally obtained by 
Thomae (1879). In the paper by Maier (2005) and Beyer et al (1987) such a 
relation is reformulated as the invariance of a suitably defined function with 
respect to the symmetric group 1S5. 
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Lemma 3.1. (Thomae's formula) The function 

T( A BCD F) — T{C) ^(A,B,C;D,E;1) 
T(A,B,C,D,E) r(D)T(E) 

has the invariance property T(A, B, C, D, E) = 

T(D-C,E-C,D+E-A-B-C,D + E-A-C,D + E-B-C) 



Proof. See Bailey (1935), Askey, Andrews and Roy (2000) for two different 
proofs and Asci et al (2008) for a probabilistic one based on the following idea: 
if U and V are two arbitrary beta random variables on (0,1) and if t is real, 
compute E((l — UV) 1 ) in two ways: expansion in a series of powers of UV or 
computation of the density of UV by multiplicative convolution. We obtain in 
this way an identity involving the parameter t and the 4 parameters of the beta 
distributions. This identity is equivalent to the Thomae's formula. □ 

It should be emphasized that Thomae's formula is an equality between analytic 
functions in their whole domain of analyticity, so certainly holds true when all 
the arguments A,B,C : D — C,E — C and D + E — A — B — C are positive. 



Proof of the basic identity (UJ). 



Theorem 3.2 Let v 

r — a > 0. Consider 



(a, b,p, q, r) T be in P and assume a + b — p > and 





' -1 











1 




1 


1 


-1 








M = 





1 


-1 


-1 


1 







1 
















1 





-1 


1 



, n 



i i 

-1 o 



-10 

o o 1 



(23) 



Then Mv is in P and if X ~ BH(v) and W ~ ^ we have T 



+xw 



BH(Mv) 



Proof: Denote Mv = (a', b',p', q', r') T . The facts that v is in P, b' = a+b-p > 
and a 1 = r — a > imply that p' = r + b — p — q > and q' = b > 0. It is 
more delicate to prove that r' — r + b — q > 0. This certainly holds when q < 
and, from r + 6 — p — <7>0, it holds when p > as well. Next we assume both 
p < and q > 0. Since (p, q,r) € P we make the following deductions. Case 
1 is impossible. In Case 2 from r — q > we get r + b — q > 0. In Case 3 we 
have (p, q, r) 6 S and since q > and r > a > it has to be 1/T(p) > which 
is fulfilled with p < only when p = —n, n being a positive integer, which 
is excluded since it would imply r — p > n, a contradiction with r — p < 1. 
Finally, in Case 4 we have 0<r<l, 0<g<l and 1/7(7' — q) > 0. Thus 
either r — q > 0, which implies r + b — q > 0, or —1 < r — q < 0. The only 
possibility is thus r — q, in which r + b — q > as well. ). Finally, the inequality 
r' + b' — p' — q' = a > together with the previous ones implies Mv € P (see 
the remark at the end of Section 2.2). 

Now denote Y = 1+ x W ■ This implies = XW and therefore for t e 
(-6, a) we can write E((l - YfY' 1 ) = E(X t )E(W t ). Because in the following 



13 



calculation the constants are quite long to write, let us adopt the following con- 
vention: we say that two positive functions / and g of t € (—6, a) are equivalent 
if t i-> f(t)/g(t) is a constant with respect to t. This fact is denoted / = g 
or -with some abuse of notation- f(t) = g(t). With this convention we get by 
replacing (s,t) in ([22]) by (*,0) 

E(**) = i^ZTX^ 3^2(p,9,o + t;r,a + 6 + t;l) 
1 (a + o + t) 

as well as 

E(W*) = r(a + & - p + t)r(r - a - 1). 

This implies 



Effi - yi*y-*i = r ( Q + & - p + *)r(r - g - t)r(a + 1) 

U j ; r(a + 6 + t) 1 ; 

x 3-P2 (P) 9, a + i; r, a + b + 1; 1) 



From formula (j22[) note that 
1.1 



y* (l-y)- 1 BH(Mv)(dx) = T(a' + t)T(b' - t) 







x 3J F 1 2(p',g',a' + t;r',a' + 6';l) 
= r(r-a + i)r(a + &-p-i) (25) 
x 3^2(r + & — p — g, b, r — a + t;r + b - q, r + 6 — p; 1) 

The knowledge of the function i H> E((l — K)*y _t ) gives the knowledge of the 
distribution of (1 — Y)/Y and of the distribution of Y. Therefore enough is to 
show that the right hand sides of[24]and[25]are equivalent. To see this we simply 
apply the Thomae formulae (Lemma 3.1) to 

A = p, B = q, C = a + t, D = a + b + t,E = r 

and this concludes the proof of Theorem 3.2. □ 

The 9 parameterization. Up to now, the BH distributions have been pa- 
rameterized by (a,b,p,q,r) belonging to the subset P of W 5 described in (|2"Tj) . 
One defect of this parameterization is the fact that 

BH(a, 6,p, q, r) — B(a, r + b — p — q,r — p,r — q,r) 

(as implied by the Euler formula) is not apparent. A second defect of the 
parameterization (a, b,p, q, r) is that it makes complicated the application of the 
basic identity. For these reasons we choose to make a linear transformation of 
a, b,p, q, r as follows. Introduce a 5-tuple of parameters 9 = (G\, 62, #3, 64, 65) € 
R 5 . For notational convenience, we set 

a = 6» 4 + 6» 5 , 6 = 6*1+6*4, p = 9 3 + 9 5 , q = 9 3 + 6> 4 , 

r = 9 2 + 9 3 + 9 4 + 9 5 (26) 
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This can be inverted as 

26»i = a + 2b - p - q, 29 2 = -a - p - q + 2r, 

28 3 = -a + p + q 28 A = a-p + q, 28r = a + p-q (27) 

From now on we denote BHg = BH(a, b,p, q, r) where (a, b,p, q, r) is given by 
((26)) . For example important particular cases like BH(1, 1, 1, 1, 2) considered in 
and BH(2, 2, 2, 2, 4) are rewritten as 

BHi i i i ddx) = 4- log —^l(o,i) (28) 

2>2>2>2>2 7T X 1 — X 



BH lxlxl {dx) 



10-7T 2 



x , 1 2 

- log — 



l ( o,i) (a:)dx. (29) 



We say that -B-ffg exists when (a,b,p,q,r) £ P. Necessary and sufficient condi- 
tions for this are given in Proposition 3.4 below. This new parameterization has 
many advantages. We see immediately that exchange of p with q is equivalent 
to the exchange of 9 4 with #5, whereas, since 

r + b- P -q = e 1 + e 2 , r- P = e 2 + e i , r -q = e 2 + o 5 

Euler's identity corresponds to exchange 8 2 with 6*3 and 6*4 with 65. Therefore 
actually the distribution 

-B-ffsi, 2 , ea.^.es = BHe u {9 2 ,e 3 },{e it e B } (30) 

being symmetric in (9 2 , 63) and (#4, #5) has rather to be considered as depending 
on 61 and on the two sets {9 2 , 83}, {64, 65}. This notation BHg u ^g ai g B -y^g it g B y is 
however a slight abuse of language since the set {6 2 , 63} could be reduced at one 
point if 6*2 = 63 and the language of multisets (sets with entire positive weights) 
could be more adapted. Up to this, we consider that the notations (|3U)1 are 
sufficiently informative. The revisited basic identity can be reformulated in this 
new notation as follows: 

Theorem 3.3. Let X ~ BH eu{ g M{eit8s} , W ~ ^+e 5 ,9 2 +e 3 and W ' ~ 
(2) 

^0i+04 02+0 5 sucn tna t \Wi ") are independent of X. Then 

BHff^ff), _Q A \.f()„_() a \, TTTTTT ^ ^^0 4 ,{0i ,0 5 } ,{0 2 ,03} ■ 



l + WX "B^i^m^a/j i + W'X 



Theorem 3.3 shows again that there are two ways to apply the basic identity. 
It shows also that the matrix M appearing in (|23[) is similar to a permutation 
matrix of order 5. 

The existence of BH e . In order to check whether BHg 1) ^g 2t g 3 y^g 4t g 5 y does 
exist we dissymetrize {#2, #3} and {94, 9 5 } by assuming 6 2 < 63 and 64 < 9 5 . 
Recall that the subset S of R 3 has been defined in ([T3"| and is the set of (x, y, z) 
such that l/r(a;)r(y)r(z) > 0. The condition (a, b,p, q,r) £ P where P is given 
by (|21l) gives the following 

Proposition 3.4. The distribution BH du fg 2i g 3 \ig /Li g ! .\, where 8 2 < 63 and 
64 < #5 exists if and only if 
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• 6»i + 2 > and 4 + 5 > and 

• either r = 2 + 3 + 4 + 5 > 1 and 3 + 4 > 

• or r = 2 + 3 + 4 + 5 < 1 , 2 + 5 < 1 and (0 3 + 4 , 3 + ft. r) € S. 

Definition. We will call G the set of parameters 01,02,03,04,05 € R 5 which 
is the image of P by the linear map (a,b,p,q,r) M- (0i,...,0s) described by 
(|2"T1) . The part of the set such that 02 < 03 and 4 < 05 is also described by 
Proposition 3.4. 

Remark. From (|!26"|) one can observe that a,b,p, q,r,r + b — p — q > is 
equivalent to 

Ol + 02, Ol + 4 , 3 + 04, 04 + 05,02 + 3 + #4 + #5 > 0. (31) 

Thus the vectors satisfying the inequalities ([31]) belong necessarily to 6. This 
certainly happens if is such that 0; + 0j > for all 1 < i < j < 5 except 
for — (2,4) (here 02 < 03 and 04 < 05). Thus in this case BHg exists 

(this observation will turn out to be useful for Theorem 5.1 below). Moreover, 
recalling the first part of the proof of Theorem 3.2 we see that the application 
of the basic identity always yields beta hypergeometric distributions with the 
vector of parameters satisfying the inequalities (|3T|) . 

4 Random continued fractions with a beta hy- 
pergeometric distribution 

It is clear the the iteration of the random transformations appearing in Theorem 
3.3, applied to X ~ BHg with = (0i, 2 , 3 , 04, 05) € 6 yield random variables 
whose distribution is of the form BHgi where 0' = (0^,02,03,04,05) £ 9 is 
obtained from a permutation of the components of 0. 

For this reason, for any G R 5 we define the finite subset Vg C Kg of vectors 
0* which can be obtained in this way. Motivated by Theorem 3.3, we are going 
to define a directed graph structure on Vg. The possible forms of these graphs 
will be quite limited. 

4.1 The graphs Gg and their subgraphs 

The role of the seven partitions of 5. There are seven partitions of 5 
enumerated in ([7]). To each point £ K 5 we attach the discrete measure on K 
which is J2j=i = Efc=i m kS Xk where {x\, . . . , x n } is the image of j 0j and 
where nik is the positive integer which is the number of j — 1, . . . , 5 such that 
0j = Xk- Thus n < 5 and mi + . . . + m n = 5 defines the partition of 5 induced 
by G R . For convenience in the sequel we take ui\ > mi > . . . > m n and we 
write 

x = xi, y = x 2 , z = X3, u — X4, v = x 5 

when these Xk do exist. Suppose for instance that the partition attached to 
is 3+2. Therefore we shall use three times the letter x and two times the letter 
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y; the 5 elements of Vg will be 



(x,{x,x},{y,y}), (y,{x,y},{x,x}), (x, {y, y}, {x, x}), 

(y,{ x > x }A x iy})> ( x ,{ x ,y}A x iv}) 

that we quickly code 

x\x 2 \y 2 , y\xy\x 2 , x\y 2 \x 2 , y\x 2 \xy, x\xy\xy (32) 

The directed graph Gg. According to Theorem 3.3, given 6 € K 5 , if 6 e 
(which means that BHg exists) and if 61+65, 62 + 63 and 9i + 64 are pos- 
itive we can move to two new beta hypergeometric distributions. Ignore for 
a while the constraints linked to inequalities. Let us extend the process to 
the whole 9's in R 5 or rather to the elements of the quotient described by 
(#i, {62, 63}, {64, 6 5 }) : from this element we move either to (6 5 , {61, 64}, {62, 63}) 
or to (64, {9i, 65}, {62,63}). These two elements may not be distinct. 
We need to introduce in (|3"2"|) the arrows 

x\x 2 \y 2 y\xy\x 2 

y\xy\x 2 x\xy\xy 

x\y 2 \x 2 — > x\x 2 \y 2 

y\x 2 \xy x\y 2 \x 2 , y\xy\x 2 

x\xy\xy y\x 2 \xy, x\xy\xy 

getting the following directed graph, called the graph Gg: 



x\x 2 \y 2 



x\y 2 \x 2 ■ 



' y\xy\x* 



■ y\ x \xy 



(33) 




It is clear that the graph Gg is the same for all 6 £ M 5 sharing the same 
partition of 5 given by the numbers mi > rri2 > . . . m n . Thus we explore the 7 
possible forms of this graph. We are particularly interested in determining the 
cycles in these graphs. A cycle of order k > 2 in a directed graph is a sequence 
vq, ■ ■ ■ ,Vk-i of distinct vertices such that (vk-i,vo) and (1^,1^+1) are oriented 
edges of the graph for i = 0, . . , ,k — 2. A cycle of order 1 is a vertex v such that 
(v, v) is an edge). 

Description of the seven graphs Gg 

• The CASE 5. Here 6 — (x,x,x,x,x) and the graph Gg is rather trivial 
with one point and one cycle of order 1 

n 

x\x 2 \x 2 (34) 
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The case 4+1. Here the graph has three vertices and is 

x\x 2 \xy •* *- x\xy\x 2 



(35) 



y\x 2 \x 2 

It has two cycles of order 2 and 3. 

The case 3+2. The graph has already been drawn in (1331) . It has three 
cycles of order 1, 3 and 5. 

The case 3+1 + 1. Here the graph has eight vertices and is 



x\xz\xy 




z\x 2 \xy —>• x\yz\x 2 ■<— y\x 2 \xz 

\ \ \ 

y\xz\x 2 -<— x\x 2 \yz z\xy\x 2 




x\xz\xy 



There are one cycle of order 2, two of orders 3, 5 and 6. 
The case 2+2+1. Here the graph has 11 vertices and is 

z\y 2 \x 2 



x\xz\y 



x\y 2 \xz ■<— y\xz\xy 



x\yz\xy — >■ y\x 2 \yz 




y\yz\ x 



' z\x 2 \y 



(36) 



There are two cycles of order 3, one of order 4, six of order 5, four of order 
6, two of order 7 and 9 and one of order 8. 

The case 2+1 + 1 + 1. The 18 vertices are 



A 


— x\xu\yz 


A' 


= x\xy\uz 


A" 


— x\xz\uy 


B 


— x\yz\xu 


B' 


= x\uz\xy 


B" 


= x\uy\xz 


C 


— u\x 2 \yz 


a 


= y\x 2 \yz 


C" 


= z\x 2 \uy 


D 


= u\yz\x 2 


D' 


= y\uz\x 2 


D" 


= z\uy\x 2 


Ei 


— u\xz\xy 


E'i 


= y\ux\xz 


E'( 


= z\xy\ux 


E 2 


= u\xy\xz 


E' 2 


= y\xz\ux 


E'I 


— z\ux\xy 



18 



Here is the graph 




There are two cycles of order 3, twelve of order 5, nine of order 6, three of 
order 7, nine of order 8, eight of order 9, three of order 10, three of order 
12, six of order 13, six of order 14, two of order 15, nine of order 16. 

• The case 1 + 1 + 1 + 1 + 1. The graph has 30 vertices and is too compli- 
cated to be drawn here. The two edges issued from u\vx\yz are given by 
u\vx\yz — > y\uz\vx, z\uy\vx. There are exactly two incoming edges, com- 
ing from x\yz\uv and v\yz\ux. There is a large number of cycles in this 
graph; the following remark helps in their determination. 

A REMARK ABOUT THE AUTOMORPHISMS OF THE GRAPHS AND THEIR CY- 
CLES. The graphs 3+1+1, 2+2+1, 2+1+1+1+1 and 1+1+1+1+1 have auto- 
morphisms induced by the permutations of the letters. For instance, the vertices 
of the graph 2+1+1+1 are coded by letters x 2 yzu and the 6 permutations of 
yzu induce a group G of automorphisms of the graph. Clearly, G transforms 
a cycle of size k into a cycle of size k. Therefore the set of cycles of size k is 
splitted in orbits. For the simpler graphs 3 + 1 + 1, 2 + 2 + 1, and 2 + 1 + 1 + 1 the 
number of orbits can be easily found by hand. We indicate below the number 
of orbits of size k for the graph 1+1+1+1+1, which have been determined by 
computer. We have not displayed the sometimes quite large number of cycles 
of each order as we did for the six others. We get the following results 

3 + 1 + 1 : 2(1), 3(1), 5(1), 6(1) 
2 + 2 + 1 : 3(1), 4(1), 5(3), 6(2), 7(1), 8(1), 9(1) 
2 + 1 + 1 + 1 : 3(1), 5(2), 6(2), 7(1), 8(2), 9(2), 

10(1), 12(1), 13(1), 14(1), 15(1), 16(3) 
1 + 1 + 1 + 1 + 1 : 5(1), 6(1), 8(1), 9(1), 12(2), 13(1), 

14(3), 15(4), 16(7), 17(3), 18(4), 19(8), 
20(7), 22(7), 23(10), 24(2), 26(15), 30(4) 

To understand this array, 30(4) on the last line means that the graph 1+1+1+1+1 
has 4 different orbits on the set of cycles of order 30 (the existence of cycles of 
order 30 implies that the graph is Hamiltonian). 
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The two subgraphs G* e and G* e * of Gg 

Consider such a graph G — Gg that we have just defined. Denote 

vg = {02,0s}, i M) 



(38) 



and write vq = vg. If (vo, v ) is arL edge of the graph recall that either v — v\ — 
(05, {0i,04>, {02, 3 }) or« = « 2 = (04,{0i,0 5 },{02,0 3 })- We say that that the 
edge (vq,v) is acceptable if 02 + 03 > and either 0i + 05 > when u = vi or 
0i + 04 > when v = ■ We denote by Gg the subgraph of G when we remove 
the non acceptable edges. Finally, in the graph Gg let us remove the vertices 
v such that BH V does not exist. We also remove the edges of Gg which are 
adjacent to these erased vertices. The remaining graph is denoted by Gg*. A 
detailed example is in order: we start from = z\y 2 \x 2 where x, y, z are distinct 
real numbers. Therefore Gg is of the 2+2+1 type and it is the graph (131)1) . We 
now assume x + y > 0,y + z > 0, y < 0. This leads to the following graph Gg 



z\y 2 \x 2 

x\xz\y 2 — >- y\xy\xz *- x\yz\xy — *- y\x 2 \yz 

^ ?r 
z\xy\xy 

x\y 2 \xz y\xz\xy ■< x\xy\yz y\yz\x 2 



z\x 2 \y 2 ' 

We assume furthermore x + min(a;, z) + 2y > 1 and we get Gg* 



(39) 



z\y 2 \x 2 



z\xy\xy 



x\y 2 \xz ■<— y\xz\xy ■ 



x\yz\xy y\x 2 \yz 



■ x\xy\yz 



Here is a second example with = a;|x 2 |y 2 where x + y > 1/2 and y < 0. 
Here Gg is of the 3+2 type and the subgraphs Gg and G* g * are respectively 



x\x \y ^y\xy\x 



y\xy\x 





~\y 2 \x 2 ■< y\x 2 \xy •< x\xy\xy x\y 2 \x 2 ■< y\x 2 \xy 



The third example with = z\x 2 \y 2 with y > x > 0, y>l and z = —x gives 
the two graphs G* 6 and Gg*. The graph G* e * has no cycle at all. 



■Ay 2 \x 2 

x\xz\y 2 y\xy\xz s- x\yz\xy y\x 2 \yz 



z\y 2 \x 2 



z\xy\xy 



x\y 2 \xz y\xz\xy ■ 



■ x\xy\yz^- y\yz\x 2 



y\xz\xy- 



' y\x 2 \yz 
\ 

■ x\xy\yz ^- y\yz\x 2 



z\x 2 \y 2 
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Remark. If 9, + 0, > for all 1 < i < j < 5 then G g = G* * . 



4.2 Random continued fractions attached to a path in Gg* 

The basic identity and the graphs. Let us fix 9 e R 5 and consider the di- 
rected graph Gg* — (V, E) To each vertex v € V is attached a distribution 73iJ„. 
To each edge (v, v') £ E is attached a pair of positive numbers corresponding to 
a /3 (2) distribution that we denote by f3 ( ~ v ' v '\ The basic identity (Theorem 3.3) 
says that if X <~ SiJt, and ~ @ <<v ' v ** are independent then j-rj^y ~ . 
In the sequel, for w > 0, we denote by H w the Moebius transformation 

= T~ — • 

1 + wx 

Proposition 4.1. Let v q — > i>i — > . . . — > t>„ be a path in Gg* of non necessarily 
distinct vertices. Let X ,Wi, . . . ,W n be independent random variables such 
that X - BH V0 and Wj - ^s-^i) for j = l,...,n. Define the random 
Moebius transformations Fj = Hw s . Then 

F n oF n _ 1 o...F 1 (X ) = - \f BH Vn . 

1 H w 2 

•■■+l + W 1 AT 

Proof. We proceed by induction on n. This is trivially true for n = 0. If it is 
true for n— 1 we apply the basic identity (Theorem 3.3) to the pair (v n -i,v n ).D 
Here is the simple theorem which can be considered as the main result of 
the present paper: 

Theorem 4.2. Let 

• • • v n -> v n -i -> . . . -> v 2 -> «i -> w (40) 

be an infinite path in G*,* and let Wi, . . . , W n , ... be independent random vari- 
ables such that W n <~ p( v n,vn-i)_ Define the random Moebius transformations 
Fj = and Z n (x) = F\ of 2 o . . . F„(a;). Then the random continued fraction 

Z = lim Z„(z) = — l — 

1 + : , w 7 . 

i+ l + W 3 ... 

associated to the infinite path almost surely exists and is independent of x. Its 
distribution is Z <~ BH Vo 

Proof. Since the graph Gg* is finite, there exists an edge (v, v') such that the 
set TV = {n; (v n ,v n -i) = (v,v')} is infinite. Since for all n G N all the W n are 
identically distributed we have ^2 neN V^ 7 " = 00 almost surely and therefore 
Y^Li 1/W n — oo almost surely. This is a classical criteria (see Chrystal (1964)) 
for claiming that Z = limn^oo Z n (x) exists and does not depend on x. Let 
now X be independent of W\, . . . , W n , . . . such that X ~ BH V . Let us apply 
Proposition 4.1 to Z n (X) when n E N: we get that Z n (X) <~ BH V() for all 
n e TV. We deduce from this that Z ~ 73iJ„ n . □ 
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Comments and examples. In Theorem 4.2 we have called the sequence ([4"u]) 
an infinite path in the graph Gq* which can also be written as 

V <- Vi i- v 2 <- ■ ■ . <- v n -i <- v n <- ... 

If one insists that an infinite path should be a map from N to the set of vertices 
of the graph and not a map on — N it would have been be more correct to say 
that we work with an infinite path in the graph where all arrows have been 
inverted. 

A simple example of application of Theorem 4.2 is the graph (|34p since 
obviously the sequence (v n ) is the constant x\x 2 \x 2 where x > 0. Here the 
distribution of Z is 

«//, , , , ,(,/:; Cz 2x '(! zf x 1 2 Fi(2x,2a:;4aj;«)l (0il) («)dz 

Examples for x = 1 /2 and x = 1 have been given in (|28p and (|29p . 

Another example of application of Theorem 4.2 is the graph that we 
present in a simpler way as 

— (41) 




c 



where a = x|x 2 |a;?/, 6 = x|a:?;|a; 2 , c = y\x 2 \x 2 . It is easily seen that x + y,x > 
implies Gg* = Gg. To any infinite word of {0, 1} we associate an infinite path 
vq ^ vi ^ . . . ^r- v n -i <— v n ■<—... in this graph as follows: each one is replaced 
by b <— c <— a <— and each zero is replaced by b a <— . For instance the word 
00101 .. . gives an infinite path ending at b as 

fe-^a-s— a-^6-^c<— a-s— 6 a <— c<— a <- . . . . 

Theorem 4.5 says that whatever is the infinite word of {0, 1} the distribution 
of the random continued fraction Z corresponding to the infinite path in the 
graph (|41[) deduced from this word is 

BH b (dz) = Cz 2x -\l - z) 2x - x 2 Fi(2x, 2x; 3x + y; z)\ Qtl) (z)dz. 



A Cauchy distribution analogy. If to = a + ib with b > 0, consider the 
Cauchy distribution C w (dx) — ^ b 2 +\x^-a) 2 ■ Now let Wx, . . . , W n , ... be indepen- 
dent random variables such that W n ~ C Wn . Assume that J2^Li l^nl -1 = 00 • 
Define 

Then Z ~ G z . We find some analogy between this elegant statement (due to 
Lloyd (1969) in the particular case of a constant sequence and Theorem 

4.2: here we consider the Moebius transformations F w (x) = w — \ and we are 
given an arbitrary infinite word W\W% . . . . It leads to the exact distribution G 2 
of 

Z = lim ° Fw 2 ° • ■ • Fw n ( x )- 
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4.3 Markov chains attached to a cycle in G* e * 

A consequence of Theorem 4.2 is about the stationary distribution of some 
Markov chains. It shows the importance of cycles in the graphs G* g * . Let us 
mention that Theorem 5.1 below will prove that the cycles of G* g . and G g * 
coincide. 

Theorem 4.3. Let vo —¥ v\ — > . . . Vk-i — > v be a cycle of order k in Gg*; let 
Wj <~ p( v 3-i> v i) with j = 1, 2, ... be independent with the convention vj — vy 
if j = j' mod k. Consider the random Moebius transformation 

G n {x) = H Wnk o H Wnk _ 1 0...0 H W(n _ 1)k+1 (x) 

and the homogeneous Markov chain (X n (x))%L on (0, 1) defined by X n (x) = 
x > and 

X n (x) = G n (X n -i(x)) = G n o G n -\ o . . . o Gi{x). 

Under these circumstances the stationary distribution of the chain is unique and 
is BH Vo . 

Proof. Consider Z n (x) — G\ o Gi o . . . o G n (x). Consider the infinite path 

Vo <~ Vk <- Vk-l V\ <- V <- Vk <- Vk-l <- ... 

Theorem 4.2 shows that Z = lim^oo Z n (x) exists almost surely and that Z ~ 
BH Vo . From Letac (1986), from Chamayou and Letac (1991) or from Propp 
and Wilson (1995) we get that the stationary distribution of the Markov chain 
(X n (x))^ =a is unique and it is the distribution of Z. □ 

Comments AND examples. For other random continued fractions there are 
some analogues of Theorem 4.3 in literature with cycles only of size 1 or 2: see 
Letac and Seshadri (1984), Lloyd (1959), Dyson (1953), Marklov, Tourigny and 
Wolovski (2008) and Asci et al. (2008) that we have mentioned in Section 4.2. 

The germane example of the present paper is in Asci et al. (2008) where 
the two BH distributions BH(a,b,a,b 7 a + a') and BH(a' ,b,a' , 6, a + a') are 
considered. Their 9 parameterizations are 

2'i-2' a ~~ ^b{^> a ~ 2!' 2 »f 2 ,a ~ 2 } ' { 2 ,a ~ 2} 

These 0's are of 3 + 1 + 1 type. With x = b/2, y = (2a - b)/2 and z = {2a 1 - b) /2 
the vertices x\xz\xy and x\xy\xz are the vertices of the unique cycle of order 2 
of the graph 3 + 1 + 1. The parameters of the beta type two random variable 
W used by the random Moebius transformation Hw sending the first law into 
the second are thus (b, a) (and (6, a') for the opposite). 

In Section 4.1 we have mentioned the existence of cycles of all sizes between 
1 and 30 (except 11, 21, 25, 27, 28, 29). Each of these cycles is associated to 
explicit distributions of periodic random continued fractions and stationary mea- 
sures of Markov chains. In particular we give the example of the homogeneous 
Markov chain G n o G n -i o . . . o G\(z) with stationary distribution BH XiViZiUtV , 
where the random Moebius transformations Gk are iid with 

G\ — H W30 o . . . H Wl 
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with the Wi's independent with f3^ distribution with parameters (increasing 
from i = 1 to i = 30) 
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We should also mention here that if 



is a non singular real matrix and if /im(^) = (ax + b) / (cx + d) is the correspond- 
ing Moebius transformation we have necessarily Km ° h-M' = Hmm 1 - Therefore 
the above results could be interpreted in terms of random walks on the group 
GL(2,R) suitably quotiented since h\M — h\i for any non zero scalar A. 

5 Cycles and positivity 

Let G = Gg be one of the seven graphs. In this section we show the delicate 
result that the cycles of Gg are always the cycles of Gg* . Since the existence of 
p(v,v ) j g eas j er t check than the existence of BH V , Theorem 5.1 happens to be 
useful and practical. Use again the notation 

v e = (0i,{02,03},{04,0 5 }) 

and suppose that vq = vg is a vertex of G. If (vq, v) is an edge of the graph recall 
that either v = Vl = (M^iAM^a}) OT v = v 2 = (9 4 , {6 U 9 5 }, {9 2 ,9 3 }) 
and that (vq, vi) is said to be acceptable if 82 + 9 3 > and 8\ + 85 > 0. 

Theorem 5.1. Let 8 — {81,82,83,84,85) e M 5 and consider the vertex vq = vg 
of G* e defined by ([5]). We assume without loss of generality 8 2 < 83 and 84 < 8 5 . 
Then vq belongs to a cycle of Gg if and only if 8i + 9j > for all 1 < i < j < 5 
except possibly for = (2,4). Furthermore if vq belongs to a cycle of Gg 

then vq € O, which means that the distribution BH V[) exists and vq is a vertex 
of G* g *. Finally if 8 2 + 84 < then 

V -> V 1 — >• V 2 -> U3 -> «4 — >• W 5 — >• U 

where 

Ui = (0 5 , {^4,^i}, {02,e 3 }), V2 = (8 3 ,{8 2 ,8 5 },{84,8 1 }), v 3 = (e 1 ,{e 4 ,e 3 },{8 2 ,9 5 }), 
V4 = (8 5 , {82,8^, {84,83}), v 5 ^ (83, {84,85}, {82,8^) (42) 

and min{^i, 03, ^5} > — max{ 82,84}. In particular in this case belongs to a 
cycle of order 1,2,3 or 6. 
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Lemma 5.2. In the graph G, let V- 2 — > u_i — > vo — > «i — > «2 such that the four 
edges {(ufc,w fc+ i); fc = -2, -1, 0, 1} are acceptable. Ifw = (8x, {82,83}, {#4, #5}) 
then the six following numbers 

01 + 62, 01 + 3 , 62 + 63, 61 + 84, 6\ + 05, 64 + 85 

are positive (we do not assume 82 < 83 and 84 < 85 here). 

Proof of Lemma 5.2. The edge (vo,vx) being acceptable we get 82 + 83 > 0. 
The edge Vq) being acceptable we get 84+85 > O.Ifwi = (#4, {0i, #5}, {62, 83}) 
the fact that (vq,Vi) is an edge implies 81+84 > and the fact that (vx,v 2 ) 
is an edge implies 81 + 8 5 > 0. Similarly Vx = (#5, {0i, 9%}, {8 2 , 83}) implies 
6»i + 84 > and 6>i + 85 > as well. Finally if u_i = (6» 3 , {84, 8 5 }, {8 X ,8 2 }) the 
fact that (v-x, vq) is an edge implies 9% + 8 2 > and 0i + 03 > 0. Similarly if 
v-i = (02, {#4, #5}, {0i, 03}) implies X + 2 > and 0i + 3 > and the lemma 
is proved. □ 

Proof of Theorem 5.1. Suppose that vo is in a cycle of G* g . Therefore the 
Lemma 5.2 is applicable to vq. Recall now that in the statement of the theorem 
we have assumed 02 < 03 and 84 < 5 . If vi — (04, {0i, 0s}, {02, 03}) , since 
(^2,^3) is an edge we get the new inequality 02 + 04 > 0. Similarly if v± — 
{95,{9i,9 2 },{9 2 ,9 3 }) implies 2 + 5 > 0. Since 4 < 5 the inequality 2 + 5 > 
holds for both possible values of v\. 

At this point observe that the only inequality to prove now is 3 + 04 > 0, 
which imply the other one 3 + 0s > 0. We discuss again the two possible values 
of v\ for applying Lemma 5.2 to the sequence V-\ — > vq — > v\ — > V2 — > w 3 . If 
V\ = (04, {0i, 5 }, {02, 03}) we have seen that 2 +04 > which imply 3 +0 4 > 0. 
Therefore the result is proved in this case. 

Now we assume vi = (05, {0i, 9 2 }, {02, #3}) and we discuss according to the 
two possible values of v 2 . If i>2 = (02, {03, 0s}, {0i, 04}) we apply Lemma 5.2 
to the sequence vq — > Vi — > «2 -+ ^3 -+ V4. Among the six inequalities we find 
02 + 04 > which implies 3 + 4 > 0. The last case is v 2 = (03, {02, 0s}, {8x, #4}): 
among the inequalities given by Lemma 5.2 we find the desired one 3 + 04 > 0. 
Finally from (|26|) BH V always exists if v is in a cycle of G* g since in this case all 
the traditional parameters a, b,p, q, r are positive. 

One can observe from the preceeding study that if 02+04 < then necessarily 
Vx = (05, {01, 02}, {9 2 , 9 3 }) and v 2 = (03, {02, 0s}, {9x, 8a})- Iterating this remark 
we see that the set (vq, v x , v 2 , V3, V4, V5) described in ((32) is a cycle. 

We now discuss the converse. Suppose that 0j + 0j > for all 1 < i < j < 5. 
In this case G = G* g and since any vertex of G is the initial vertex and the end 
vertex of some arrows, any vertex of G belongs to a cycle. Now suppose that 
8i + 8j > for all 1 < i < j < 5 except for = (2,4). We have described 
above the corresponding cycle. For seeing that min{0i, 3 , 5 } > max{0 2 ,04} 
observe that we have seen that in an element of the cycle 8i + 8j < can happen 
only for one pair (i,j). For instance for we had (i,j) = (2,4) and therefore 
02 < 03 and 04 < 05 must be strict inequalities. Now making the same remark 
for the other («i, v 2 , V3, V4, ^5) of (|4"2")l shows min{0i, 03, 85} > max{02,04}. □ 

Cycles when 2 + 04 < 0. Assume as in Theorem 5.1 that 2 < 3 and 4 < 5 
and that 02 + 04 < 0. In this case vq as in Theorem 5.1 can belong to a cycle of 
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size 1,2,3 or 6. Let us indicate here without proof the necessary and sufficient 
conditions for this. This is done by analyzing (l4"2"j) . 

• Vo is in a cycle of size 1 if and only if vq = x\xy\xy with x + y > and 

y <o. 

• Vo is in a cycle of size 2 if and only if vq = x\xy\xz with x + y, x + z > 0, 
y + z < and y =/= z. 

• t>o is in a cycle of size 3 if and only if vq = y\xz\xu with x+y, x+z, x+u > 0, 
x < and y,z,u are not all equal. 

• wo is in a cycle of size 6 if and only if vq = x\yz\uv with y + u < 0, all the 
other sums of pairs are positive y + 1 u and x, z, u are not all equal. 

Example. Let us use the identity 2-Fi(p> — p; 1/2; sin 2 0) = cos2p0. It provides 
a hypergeometric function which is certainly positive on (0, 1) if < p < 1/2. 
Consider the distribution BH(2a, b,p, —p, 1/2) for a, & > 0. From (|27p it is equal 
to BHg with 

0\ = a + fe, 02 = — ck, ^3 = —a + — , 04 = a — p, 65 = a + p 

Note that O2 < O3, O4 < 85 and 02 + #4 = — P < 0. It is easy to detect with the 
help of Theorem 5.1 that that vg belongs to a cycle of Gg if and only if 1/4 > a 
and b + 2a > p, with < p < 1/2. Since 62 + O4 < this cycle is described by 
63). 

A cycle of order 6 changed in A cycle of order 5. The above analysis 
includes 82 + 84 = 0. In this case BH Vj for j = 0, . . . , 5 are beta distributions or 
quasi beta distributions. Since in this case the same distribution has an infinite 
number of 8 parameters, it makes sense to ask if these different representations 
could appear within a cycle. This happens only in the following case. Taking 
v = (x,y,x + 2y,-y,x) with x > y > we get v x = (x,-y,x,y,x + 2y), 
v 2 = (x + 2y, y, x, -y, x), v 3 = (x, -y, x + 2y, y, x), v 4 = (x, y, x, -y, x + 2y), 
v 5 = (x + 2y, -y, x, y, x) and 

BH Vo flx — y : x-\-y> BH Vl /3x+3y,x — y? BH V2 — Px—y,x-k-3yi 
BH V3 — flx+y ,x— yi BH VA /3 x +y ^x+y , BH V ^ — /3 x +y , x -\-y • 

We observe that BH Vi = BH V5 . All the other BH V are different. Therefore, 
starting with a cycle of order 6 we can design a beta distributed random con- 
tinued fraction of the type of Theorem 4.3 with period k = 5. One can prove 
that this phenomenon appears only for the above choice of parameters. 
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